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Abstract 

Background: The type II clustered, regularly interspaced, short palindromic repeat (CRISPR)/ CRISPR-associated 
protein 9 (Cas9) system is a novel molecular tool for site-specific genome modification. The CRISPR-Cas9 system 
was recently introduced into plants by transient or stable transformation. 

Findings: Here, we report gene targeting in rice via the Agrobacterium tumefaciens-mediated CRISPR-Cas9 system. 
Three 20-nt CRISPR RNAs were designed to pair with diverse sites followed by the protospacer adjacent motif 
(PAM) of the rice herbicide resistance gene BEL. After integrating the single-guide RNA (sgRNA) and Cas9 cassette in 
a single binary vector, transgenic rice plants harboring sgRNA:Cas9 were generated by A. tumefaciens-mediated 
stable transformation. By analyzing the targeting site on the genome of corresponding transgenic plants, the 
mutations were determined. The mutagenesis efficiency was varied from -2% to -16%. Furthermore, phenotypic 
analysis revealed that the biallelic mutated transgenic plant was sensitive to bentazon. 

Conclusions: Our results indicate that the agricultural trait could be purposely modified by sgRNA:Cas9-induced 
gene targeting. CRISPR-Cas9 system could be exploited as a powerful tool for trait improvements in crop breeding. 

Keywords: Gene targeting; C/?/5P/?/Cas9; Rice; Agrobacterium; Stable transformation; Genome editing 



Findings 

Increases in the human population as well as in extreme 
weather events require the sustaining improvement of 
crop varieties. A directed, rapid, and low-cost method is 
critical for updating high-yield, multi-stress resistant var- 
ieties. Molecular marker-assisted selection (MAS) and 
genetic modification (GM) methods have advantages in 
capturing favorable agricultural traits. However, the tar- 
geted editing technology of key functional genes promises 
to be a powerful tool in accelerating varietal improve- 
ment. Zinc finger nucleases (ZFNs) and transcriptional 
activator-like effector nucleases (TALENs) have proven to 
be effective in plant-targeted genome editing (Chen and 
Gao 2013; Li et al. 2012; Shan et al. 2013a; Zhang et al. 
2013); however, a simple, affordable, and high-throughput 
method is still needed. Recently, the bacterial type II 
clustered, regularly interspaced, short palindromic repeat 
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(CRISPR)/CRISPR-associated protein (Gas) system has 
attracted attention due to its ability to induce sequence- 
specific genome editing. The site specificity is defined by 
the complementary base pairing of a small CRISPR RNA 
(crRNA). After annealing to a ^raf25 -activating crRNA 
(tracrRNA), the crRNA directly guides the Gas9 endo- 
nuclease to cleave the target DNA sequence. In humans, 
zebrafish, Drosophila, mice, and rats, genome editing can 
be achieved by simply combining an engineered Strep- 
tococcus pyogenes Cas9 {SpCas9) and a synthetic single- 
guide RNA (sgRNA) consisting of the crRNA and the 
tracrRNA (Cong et al. 2013; Feng et al. 2013; Gratz et al. 
2013; Li et al. 2013a; Wang et al. 2013; Xiao et al. 2013). 
The CRISPR/Cas9-induced genome editing was also 
recently performed in model plants (e.g., Arabidopsis, 
Nicotiana benthamiana) and crops (including wheat, 
rice and sorghum) by transient or stable transforma- 
tion (Belhaj et al. 2013; Feng et al. 2013; Jiang et al. 
2013; Li et al. 2013b; Mao et al. 2013; Miao et al. 
2013; Nekrasov et al. 2013; Shan et al. 2013b; Xie and 
Yang 2013). Here, we demonstrate that the CRISPR/Cas9 
system could achieve efficient gene targeting in rice and 
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modify the corresponding agricultural trait by Agrobacter- 
ium tumefaciens-medrnted stable transformation. 

The rice Bentazon Sensitive Lethal {BEL, LOC_ 
Os03g0760200) gene confers resistance to bentazon and 
sulfonylurea herbicides. The loss-of-function mutant bel 
is sensitive to the herbicides (Pan et al. 2006). In two- 
line hybrid rice production, when male sterile lines are 
developed in the bel background, the problem of hybrid 
seed contamination by the selfing of sterile lines can be 
solved by simply spraying bentazon at the seedling stage. 
Although BEL has the potential to improve hybrid rice 
production safety, the limited natural genetic resources 
greatly restrict its application. Therefore, we selected 
BEL gene as the target for sgRNA:Cas9-based disruption. 
A 20-nucleotide (nt) region (target- 1) at the 5' end of a pro- 
tospacer adjacent motif (PAM) in the second exon of BEL 
was selected as the editing target (Additional file 1: Method, 
Additional file 2: Figure SI). Complementary oligonucleo- 
tides were synthesized and were fused to the default sgRNA 
scaffold (Mali et al. 2013) under control of an Arabidopsis 
U6-26 gene promoter (U6-26p). A double 35S promoter 
(pd35s) was used to drive a plant codon-optimized SpCas9 
(pSpCas9) coding region. In order to improve the efficiency 
of delivery, both of the sgRNA and Cas9 expression cassette 
were subcloned into a single binary vector (Figure la. 
Additional file 3: Figure S2). The construct was introduced 



into the embryonic calli of japonica rice cv. Nipponbare via 
A, tumefaciensy and independent transgenic events were 
isolated in the presence of hygromycin (Additional file 1: 
Method). The target region was amplified from the gen- 
omic DNA of transgenic plants and was Sanger-sequenced 
to detect sequence alterations. 15 mutations, including nu- 
cleotide deletions and substitutions, were identified in 14 
lines from 90 independent transgenic plants (Figure lb), in- 
dicating SgRNA: Cas9-induced gene targeting in rice. 

To determine the effect of sgRNA: Cas9-directed gene 
targeting, we investigated the herbicide resistance of the 
mutated plants. A transgenic line with biallelic muta- 
tions (a 1-nt deletion of the 17th "C" and a 4-nt deletion 
of the 14th "CATC", as shown in the first and the fourth 
mutated sequence in Figure lb) was selected. The bialle- 
lic mutated plant and non-mutated transgenic lines were 
sprayed with bentazon. After 7-day incubation, the con- 
trol leaves from the non-mutated transgenic plants 
remained green; however, the biallelic mutated leaves be- 
came severely wrinkled and yellowed (Figure Ic), con- 
firming the functional disruption of the BEL gene. 

To investigate influencing factors of the rice sgRNA: 
Cas9 mutagenesis efficiency, the gene targeting on 2 
regions (a 20 nt region located in the first exon of BEL 
referred as the target-2 v.s. the target- 1, Additional file 2: 
Figure SI) were performed using 2 codon types of Cas9 
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5'— GCCGAGATCGACGCATCCGT 



AAGGCi Ac; I c c ca r Arc A A 



3' -CGTICGGCTCTAGCTGCGTAGGCAGCCGTTGAGAGCGGACTAGTG— 5' 
II II I I I I III I II II I I I II III I 

5' -GCAAGCCGAGATCGACGCATCCGTCGGCAACTCTCGCCTGATCAC-3' 



Target 1 sequence 



PAM 



Target-1 site on BEL with stably transformed sgRNA:Cas9 

SgRNA -.pSpCasQ ( 1 4/90) 1 5.6% mutated 

5' ... GCGCAAGCCGAGATCGACGCATCCGTCGGCAACTC...3' WT 

5' ... GCGCAAGCCGAGATCGACGCAT-CGTCGGCAACTC...3' -1 (X4) 

5' ... GCGCAAGCCGAGATC -ACGCATCCGTCGGCAACTC...3' -1 

5' ... GCGCAAGCCGAGATCGACGCAT--GTCGGCAACTC...3' -2(X2) 

5 ' . . . GCGCAAGCCGAGATCGACG CGTCGGCAACTC. . .3 ' -4 

5' ...GCGCAAGCCGAGATCGACGCAT-- ATCGGCAACTC...3' -3/+1 

5 ' . . . GCGCAAGCCGAGATCGACGCATCTGTCGGCAACTC. . .3 ' ml 

5' ... GCGCAAGCCGAGCTCGACGCATCCGTCGGCAACTC...3' ml 

5' ... GCGCAAGCCGAGATCGAAGCATCCGTCGGCAACTC...3' ml 

5 ' . . . GCGCAAGCCGAGATCGACGCATCGGTCGGCAACTC. . .3 ' m 1 

5' ... GCGCAAGCCGAGATCGACGCATATGTCGGCAACTC...3' m2 

5' ... GCGCAAGCCGAGATCGACGCAACGCTCGGCAACTC...3' m3 

sgRNA:hSpCas9 (2/96) 2. 1 %mutated 

5' ... GCGCAAGCCGAGATCGACGCATCCGTCGGCAACTC...3' WT 

5' ... GCGCAAGCCGAGATCGAAGCATCCGTCGGCAACTC...3' ml 

5' ... GCGCAAGCCGAGATCGACGCATGCGTCGGCAACTC...3' ml 



non-mutanted 



biallelic mutant 



Target-2 site on BEL with stably transformed sgRN A:pSpCas9 

(2/94) 2.1% mutated 

5' ...TCGGCCGCGGCAATGGCGGGGCGGCGCGGCTGCCG ...3' WT 

5' ...TCGGCCGCGGCAATGGCGGG--GGCGCGGCTGCCG ...3' -2 

5 ' . . .TCGGCCGCGGC AATGGC ACTATGGCGCGGCTGCCG . . .3 ' m5 

Figure 1 Gene editing in rice using the stably expressed sgRNA:Cas9 system, (a) Diagram of the sgRNA:Cas9 complex targeting tlie rice BEL 
gene at tlie target-1 site. PAM, the protospacer adjacent motif. gRNA, the guide RNA scaffold (Mali et al. 2013). (b) sgRNA:Cas9-induced rice BEL 
gene mutations at the target-1 site. Upper: 14 mutants were identified in 90 independent lines transformed the target-1 sgRN A:pSpCos9. Lower: 2 
mutants were identified in 96 independent lines transformed the target-1 sgRH A:hSpCas9. The yellow shadow marks the target sequence that is 
recognized by the crRNA. The deletions, insertions, and nucleotide substitutions are marked in red. (c) The phenotypes of the bel mutants. The 
leaves were sprayed with 1.5 g/L bentazon for 7 days. Upper: the transgenic plant without the target mutation; lower: the biallelic mutant with 
the site-specific mutation. Scale bar= 1 cm. (d) sgRNA:Cas9-induced rice BEL gene mutations at the target-2 site. 2 mutants were identified in 94 
independent lines transformed the target-2 sgRNA:pSpCas9. 
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(humanized SpCas9, hSpCas9 v.s. pSpCas9), After stably 
transformed by Agrohacterium, target regions of trans- 
genic plants were screened. On the target- 1, only 2 mu- 
tants were identified from 96 independent lines using 
hSpCas9 (2.1% mutated), compared to the 15.6% mu- 
tagenesis frequency using pSpCas9 (Figure lb). The sig- 
nificant different on the mutagenesis frequency at the 
target- 1 (P < 0.01, test) might be related to the higher 
level of Cas9 protein accumulation in the pSpCas9 
transgenic plants than in the hSpCas9 transgenic plants 
(Additional file 3: Figure S2), similar to previous results 
that were obtained by transient assays (Li et al. 2013b). 
Although sgRNA expression is considered as the limit- 
ing factor for effective mutagenesis (Li et al. 2013b), our 
result suggests that the accumulation of Cas9 could also 
increase the targeting efficiency. The mutagenesis effi- 
ciencies were further determined on the target-2. For 
the target-2 sgRNA--pSpCas9 construct, 94 independent 
transgenic lines were generated, and 2 mutants were 
identified (2.1% mutated). Meanwhile, transgenic plants 
harboring the target-2 sgRNA'hSpCas9 were generated; 
however, no mutant was found among all of 96 trans- 
genic lines (Figure Id). Theoretically, every (N)2o-NGG 
in genome could be recognized as the target site for 
SgRNA: Cas9-based gene targeting; however, the muta- 
genesis efficiency markedly varied between the 2 tar- 
geted sites using the same pSpCas9 (P < 0.01, test). 
The target-2 was located in a GC-rich region (85% GC 
content at the target site and 71% within the 100-bp re- 
gion comprising the target), whereas the GC content of 
the target- 1 was relatively lower (65% at the target site 
and 60% within the 100-bp region comprising the tar- 
get). In the absence of further statistical data, we specu- 
late that the mutagenesis efficiency might be correlated 
with the nucleotide composition or the epigenetic state 
of target regions. 

Several reports have suggested that the first nucleotide 
of the target sequence should be 'GI if the U6 promoter 
was used to drive sgRNA expression (Li et al. 2013b; 
Mali et al. 2013). This limitation would largely restrict 
potential mutation sites in genome editing. Therefore, 
we tested the stringency of the nucleotide on the first 
position. A 20-nt sequence beginning with A' was se- 
lected and was marked as the target-3 (Additional file 2: 
Figure SI). The gene targeting was performed with the 
binary vector containing pSpCas9, Among 96 independ- 
ent transgenic lines, we identified 2 mutants (Additional 
file 4: Figure S3), suggesting that the 'G' is not absolutely 
required on the first position of the target sequence 
driven by the U6 promoter. 

Since the specificity of sgRNA: Cas9 system became a 
major concern of the application (Hsu et al. 2013), the 
off-target efficiency was further investigated. After search- 
ing rice genome using the target- 1 sequence, three highly 



identical sites were found. Their sequences had 1-base or 
3-base mismatches to the target- 1 sequence. All of three 
genomic regions were examined in the target- 1 sgRNA- 
pSpCas9 transgenic lines. However, we did not observe 
any mutation at these sites by sequencing (Additional 
file 5: Table SI). 

The delivery method of CRISRP/Cas system would be 
an important part of the potential application for crop 
improvement. The efficiency of gene targeting using the 
biolistic transformation method is relatively higher than 
the efficiency obtained in this study (Shan et al. 2013b). 
Biolistic transformations frequently lead to multi-insertion 
copies insertion in transgenic plants, and possibly cause 
higher expression levels of sgRNA and Cas9 protein. 
However, the transgenes induced by A. tumefaciens usu- 
ally exhibit a much higher single-site insertion frequency 
(Dai et al. 2001), and the insertions should be easier to lo- 
cate in the plant genome. Therefore, the genome would 
be easier to be identified in progenies of A, tumefaciens- 
mediated sgRNA:Cas9-targeting rice. In the near future, 
the direct application of A, tumefaciens-medmted sgRNA: 
Cas9-induce gene targeting on key functional genes would 
become a promising biotechnological strategy to facilitate 
the breeding of rice and other major crops which trans- 
genic technologies are available. 

Accession codes 

The accession number of BEL gene is NCBI GeneBank: 
DQ341412. 

Additional files 

Additional file 1: Method: Plasmid construction and rice 
transformation. 

Additional file 2: Figure 51. Target loci in rice BEL gene. 

Additional file 3: Figure 52. Description of the sgRNA:Cas9 binary 
vector and the expression of Cas9. A, T-DNA insertion region of sgRNA: 
Cas9 binary vector. The Arabidopsis U6-26 gene promoter (U6-26p) and 
its terminator (U6 ter) were used to express sgRNA; The double 35S 
promoter and the NOS terminator were used to express SXFLAG-tagged 
human or plant codon-optimized spCas9; The hygromycin was used as 
the plant selection marker for the vector. NLS, nuclear localization 
sequence. B, The plant codon-optimized spCas9 showed a relatively 
higher expression level than human codon-optimized spCas9 in transgenic 
plants by detecting FLAG-tag. The ACTIN were used as an internal control. 

Additional file 4: Figure 53. Site-specific mutations of transgenic 
plants at the target-3 with an 'A' at the start position of the 20 bp 
sequence. The yellow shadow marks the target sequence recognized by 
crRNA. The blue underline indicates the protospacer adjacent motif 
(RAM). DNA mutations are showed in red as letters and dashes, respectively. 

Additional file 5: Table 51. The detection on off-target mutations on 
the transgenic plants with the target-1 sgRNA:pSpC6/s9. Table S2. List of 
PGR primers used and their application. 
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